Length | Sentence |
---|---|
15 | Yen Jaya Kudus. |
15 | Tapi Riyadi SH. |
15 | Bisa jadi riya. |
16 | Kurang dahareun. |
16 | Jaya Setia, Kec. |
17 | Sulawesi Selatan. |
17 | Cempaka Putih Jl. |
18 | Tanjung Barat Kec. |
18 | Harapan Jaya, Kec. |
18 | Sek biasa wae lah. |
Length | Sentence |
---|---|
20 | Luar biasa hari ini! |
22 | Dapatkan aplikasi ini! |
24 | Ganti jaman ganti lakon! |
35 | Tong pedah tiris jadi tiiseun yeuh! |
36 | Hemat waktu dan kontak sekolah sini! |
37 | Matak biayana oge kumaha koprasi bae! |
38 | ! cari solusi lah jgn seenaknya terus! |
Length | Sentence |
---|---|
26 | Apakah masih bisa diralat? |
29 | Nanti saya tes nya ke jkt ya? |
30 | Apakah harus datang ke kampus? |
31 | Terus yang di nilai Asesor apa? |
33 | Dengan akibat apa atau hasil apa? |
35 | Apakah mungkin saya bisa bergabung? |
38 | Na saha nu ngomongkeun tukang seureuh? |
Here we see the absolutely shortest sentences in the corpus. In three tables we find declarative, exclamatory and interrogative sentences.
The sentences give some insight into the language or the corpus. Moreover, in the case of malformed sentences they may give hints for better preprocessing.
We find only sentences which were accepted by the preprocessing. For language detection, usually a minimum number of known words is necessary. Because of this, some very short sentences may be missing in the corpus.
select char_length(sentence) as le, sentence from sentences where sentence like "%!" and 40>length(sentence) order by le limit 15;
4.1.2 Sentences of fixed length I
4.1.3 Sentences of fixed length II
4.1.4 Sentences of fixed length III
4.1.5 Longest sentences